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Abstract 

We study the activity, i.e., the number of transactions per unit 
time, of financial markets. Using the diffusion entropy technique we 
show that the autocorrelation of the activity is caused by the presence 
of peaks whose time distances are distributed following an asymptotic 
power law which ultimately recovers the Poissonian behavior. We 
discuss these results in comparison with ARCH models, stochastic 
volatility models and multi-agent models showing that ARCH and 
stochastic volatility models better describe the observed experimental 
evidences. 



1 Overview 

As is well known, financial time series present a strongly inhomogeneous time 
behavior. This is specially true when one considers either the volatility or the 
activity, i.e., the number of transactions per unit of time. Indeed if we look at 
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the variance of the return in a time window of, say, one day, we will observe 
periods of relative constant and regular behavior followed by other periods 
of strong variation of the price. In the same way there are days with few 
transactions and others where the number of trades is considerably larger. 
This great variability in the volatility or in the activity is generally referred 
to as volatility clustering or intermittency of volatility and activity. In this 
work we refer to both quantities. We will thus perform measures on the ac- 
tivity and use two volatility models: (i) ARCH models pQ and (ii) stochastic 
volatility (SV) models 00], where the relationship between volatility and 
activity is set by the usual assumption of proportionality between them [HE]- 
As is well known, the time interval or distance between two consecutive 
transactions r is a random variable described by a probability density func- 
tion (pdf) i/j(t) which in many cases presents an asymptotic power law of 
the form [HI Ej 



However ipfa) does not tell anything about the independence of consecutive 
r's. We note that if consecutive r's are independent a power law tail in ijjfa) 
can explain an inhomogeneous behavior in the number of events per unit of 
time, where a possible measure of this inhomogeneity is the distribution of 
the number of trades in a fixed period of time t. As Feller proved many years 
ago |H], if the time interval r between some particular events, which we will 
call markers, in a time series is distributed according to a given density ip(r) 
and the independence condition farj) = fa) fa) for i ^ j holds, then the 
probability distribution to observe a fixed number of these markers y in a 
given time interval, p(y,t), follows a scaling law of the form 



where 5 is some positive exponent and F(x) is a positive and integrable func- 
tion. With the help of a recently developed technique for the analysis of time 
series called Diffusion Entropy (DE) [0] , we will see that the scaling observed 
in the distribution of the number of transactions in a time interval does not 
correspond to Feller's analytical prescription obtained with the density ip(r) 
estimated from data. 

We are thus forced to make the additional hypothesis that consecutive 
r's are not independent. We will also assume that this correlation is due to 
the presence of peaks (or clusters) in the mean activity followed by periods 
of relative calm. Therefore the Tj's are positively correlated because during a 
peak of activity they are shorter than the mean value while away from a peak 
they are greater than the mean. Indeed, in such a case (fa — fa)) fa — fa))) > 
which implies a positive correlation: farj) > fa) fa). 
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Let r c be the random time distance between two consecutive peaks and 
denote by 0(r c ) its probability density function. Similarly to the distribu- 
tion of the distance between two consecutive transactions Tp{r), we will also 
assume that 0(t c ) obeys an asymptotic power law: 



In this scheme the results of the DE technique can be described directly 
in terms of the time distance and the magnitude of the peaks of activity, 
the latter described by a pdf h(x). We will see, like in ref. JT0], that the 
distribution of the size of the cluster given by h(x) does not play an important 
role, because the time distance distribution 0(r c ) is characterized by a more 
anomalous exponent than h(x). Consequently we will interpret the results of 
DE as a consequence of a non-Poissonian distribution of the distance between 
peaks of activity. 

There are in the literature several approaches that try to explain the au- 
tocorrelation of activity and volatility. One recent model [3JE1E! is based on 
the hypothesis that the intermittency of activity is caused by a subordination 
to a random walk, like in the case of the so called on-off intermittency [T2] . 
As clearly described in [5J, this procedure should give for the distribution of 
distances between clusters a scaling law of the form 



where f(t) is a cutoff function ensuring the existence of the first moment of 
4>(t c ) and A is the time scale at which this cutoff takes place. One simple 
choice for f(t) is given by the exponential f(t) = e~ l which allows that </>(t c ) 
presents an asymptotic Poissonian behavior. 

As we have already mentioned, other possible approaches to the problem 
of activity correlation are provided by ARCH models or SV models. We will 
show that both, ARCH and SV, models lead to a correlation in the volatility 
which more likely resembles to a power law tail exponent observed in a variety 
of financial markets. We will also show that a particular ARCH model, the 
TARCH model presented in [T3], and the SV model presented in [3 El both 
result in the same scaling law than that observed with the DE technique. 
Finally, and due to the absence of intra-day disturbances in the time series 
obtained by ARCH and SV models, we are also able to evaluate numerically 
the waiting time distribution 0(r c ) of the distance between peaks and this 
distribution is compatible with the empirical evidence given by a power-law 
behavior for a (long) transient period followed by a Poissonian (exponential) 
behavior. We incidentally note that this asymptotic exponential behavior 
indicates that very far clusters do not influence each other. 
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The paper is organized as follows. We start with a brief review on the DE 
technique and a simple analytical proof that the DE results are determined 
by the most anomalous power law tail between 0(r c ) and h(x). After that we 
show the results obtained by means of the DE technique on tick by tick data 
of a Foreign Exchange (FX) market. We also perform a filtering procedure 
on data in order to prove that the observed scaling is due to the anomaly 
in the waiting time pdf 4>(r c ) and not in the cluster size pdf h(x). Finally, 
we briefly describe the ARCH and the SV models and the results of the DE 
and the waiting time distribution on the time series constructed using these 
models. 

2 Diffusion entropy analysis 

The diffusion entropy technique is basically an algorithm designed to de- 
tect memory in time series jH]. DE is specially suitable for intermittent 
signals, i.e., for time series where bursts of activity are separated by peri- 
ods of quiescent and regular behavior. The technique has been designed to 
study the time distribution of some markers (or events) along the time series 
and thus discover whether these events satisfy the independence condition 
( T i T j) = ( T i)( T j) 7^ j) where r, is the time interval between the marker 
labeled i — 1 and the next one % [TUJ EH El E] • As marker we use here a 
very simple definition: each trade in the time series is a marker. In order 
to apply the DE technique we need to construct a new series & which is a 
function of a coarse grained time i ■ At (in our case At = 1 s) and where £j 
is precisely the number of transactions that occurred in the previous second. 
We next define a new random process through the following moving counting 
on & 



Note that yi(t) is precisely the number of markers (i.e., trades) in an interval 
of length t starting at position I. If we vary the value of I along the interval 
[0,N — t/At], where N is the total length of the sequence, we can obtain 
the probability density function, p(y,t), of this random process. It has been 
shown in [Oj that for the zero-mean process y — > y — (y{t)) and assuming that 
£i is a renewal process, then p(y,t) obeys the scaling law given by Eq. (J2J). 
Hence, the entropy of this random process reads |Hj 



yi(t) 



E 6- 



(5) 



i=l-t/M 



OO 



S(t) = - I p(y, t) In \p(y, t)]dy = A + 5 ln(t) 



(6) 
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Figure 1: Results of the DE analysis for US dollar - Deutsche mark futures 
market for different values of the time-threshold T. 



From the slope of S(t), in a logarithmic scale, we get an estimation of the 
scaling parameter 5. In Fig. ^we show the results obtained with DE on tick 
by tick data of the US dollar- Deutsche mark futures market from 1993 to 
1997 with a total of 1.3 x 10 6 data points (solid circles) ^Hj- The fit gives 
5 = 0.90. As is shown in [9 j if condition (TjTj) = (Tj)(ry) holds then there 
exists a relation between the scaling exponent 5 and the exponent /3 of the 
power law tail of the time distribution of markers (in our case, trades) ip(r) 
(see Eq. ((H)). This relation reads 



We have shown elsewhere that for the FX market under consideration the 
power law tail exponent of the waiting time distribution between trades is 
near 3.5 |B1 U\- According to Eq. (jJJ) this would lead to 5 = 0.5 — after 
a transient comparable with the mean trade distance of (r) = 23.6 s — in 
disagreement with the value 6 = 0.90 obtained by DE. 

We now present a picture which takes into account peaks of activity 
separated by periods with a low number of transactions. We see in Fig. El a 
schematic representation of this picture in which, for instance, the intensity 
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1/(13-1), if 2 < (3 < 3; 
0.5, if p>3. 
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Figure 2: Scheme of the model for the anomalous scaling for activity and 
volatility. 



23 of the third peak is represented by a black spot and this corresponds to 
the total number of transactions attributed to this peak. 

We suppose that the time intervals between peaks, r c i , are distributed 
according to a pdf 0(r c ) which asymptotically behaves as in Eq. @, i.e., 
4>(j c ) ~ We also assume that the intensity of a given peak, Xi, defined 

as the total number of transactions in the peak, has a distribution given by 
a pdf h(x). In the context of earthquakes h(x) is generally referred to as 
the Pareto law of the size of the earthquake clusters, since asymptotically 
h(x) ~ l/x a+1 as in the usual Pareto distribution [TU] . 

We will now present a proof that the DE technique perceives the most 
anomalous (i.e., the smallest) of the two exponents \i and a + 1. Indeed, 
let p(x, t) be the joint pdf for the waiting time r c between clusters and their 
intensity x. We denote by 

/oo /*oo 
dxe iuJX / dr c e~ STc p(x,T c ) 
-00 Jo 

its Fourier-Laplace transform. Observe that in terms of p(w, s) the Laplace 
transform of the waiting time distribution <ft(s) and the Fourier transform of 
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the size distribution h (u) are given by 

<f>(s) = p(uj = 0, s) and h(u) = p(u, s = 0). (8) 

We assume that the time duration of a peak is negligible with respect to the 
mean time distance between peaks (THj. In such a case we can use the Con- 
tinuous Time Random Walk (CTRW) formalism to calculate the probability 
density function, p(y,i), that the number of trades at time t is given by y. 
Thus, in terms of the joint distribution p(oJ, s) the Fourier-Laplace transform 
of p(y, t) is given by [HI U\ 

flu,, 8 ) = ^^-^ r. 9 

s 1 — p(u, s) 

We can easily see that as s — > (i.e., t — > oo) [1 — <p(s)]/s ~ (r c ), where 
(r c ) is the mean waiting time. Hence 

p(u,s)~ ^ (a-0). (10) 

1 - p(u, s) 

Note that, as s — > we have (see Eq. (jSJ)) 

}(s) ~ 1- (r^s + co^ 1 , (2<^<3). (11) 

Moreover, as c<j — > and consistently with the Pareto law according to which 
h(x) decays as l/x a+1 , we have 

~h(uj) ~l+i(x)uj + b Q uj a , (l<a<2), (12) 

where (x) is the average peak intensity. Taking into account Eqs. (fTT j) - (jT2]) 
we see that as s — > and uj —>■ the joint distribution p(ui, s) can be written 

as 

P(lo, s) ~ 1 — (t c )s + — i(xr c )ujs + b(s)uj a + c(cj)s m_1 , (13) 

where 6(s) and c(uj) are such that 6(0) = b and c(0) = cq. We recall that 
the DE technique measures the scaling in a "moving reference frame" where 
the average activity is zero, (y(t)) = for all t > 0. In order to obtain the 
pdf for y(t) in such reference frame we perform in Eq. (|13|) the following 
substitution 

s — + (14) 

( T c) 

and after applying the diffusive limit (t c )|s| (x)\u>\ we get, to the lowest 
order, 

p(u, s) ~ 1 - (r c )s + 6 ^ Q + c (i(a;}/(r c })' 1 - 1 ^- 1 . (15) 
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Substituting Eq. (JT5j) into Eq. (fTUj) finally yields 



s(t c ) -bu a -c(i{x)/{r c )Y 



i 



(16) 



This equation shows that the smallest exponent between a and fi — 1 deter- 
mines the asymptotic scaling of p(y, t) according to the exponent 



Therefore, the scaling perceived by DE is determined by the most anomalous 
exponent of the scaling between the size of the clusters of activity and the 
distribution of their time distances. Note that the case 5 = — 1) agrees 
with that of Eq. (jJJ). We also observe that we have proven this fundamental 
result for the most general case in which there is no assumption on the 
possible correlation, or independence, among intensities and waiting times. 

Having this in mind, we return to the problem of understanding the 
scaling exponent 5 = 0.90 appearing in the US dollar - Deutsche mark futures 
market. To what effect is due this scaling? In other words, is the exponent S 
determined by the time distance between clusters or by their size? In order 
to solve this question we impose a cutoff in the size of the peaks of activity 
by eliminating those transactions whose time distance from the previous one 
is below certain threshold T (note that this actually reduces cluster sizes 
because the number of transactions counted is now smaller). If after this 
cutoff procedure the scaling remains invariant then 6 would be determined 
by the time distances and not by the size of the clusters. In Fig. ^ the 
DE results are shown for different values of the time-threshold T ranging 
from to 80 s. We see there that the slope is practically unchanged which 
confirms the assumption that the exponent 5 = 0.90 is solely determined 
by the anomaly in the time distances between the clusters and not by any 
anomaly of their size. 

3 ARCH, Stochastic volatility and on-off in- 
termittency models 

At the end of the last section, we have indirectly shown that the anomalous 
scaling 5 = 0.90 observed in data is not caused by fat tails in the peak in- 
tensity distribution h(x) but by the anomalous scaling in the waiting time 
distribution between peaks. Another more direct way to prove this would 
have been to single out the peaks on real data and look for their time dis- 
tribution. Unfortunately it is very difficult, on real data, to define a peak of 




1/(^ — 1), if fi — 1 > a; 
1/a, if \i — 1 < a. 



(17) 
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activity and compute the waiting time distribution between them. This is 
because there are peaks of activity that appear at fixed times (we will call 
these "deterministic peaks" ) at the daily opening and closing sessions, at the 
opening during the day of other markets and even weekly at the opening 
of each monday [20] • These deterministic peaks do not contribute to the 
increase of entropy. However, they do affect any estimation of the waiting 
time distribution making it very difficult to get a reliable estimate of it. A 
possible way out from this situation would be to generate an artificial time 
series simulating the real market evolution. In this artificial series, the ac- 
tivity would be replaced by volatility following the accepted correspondence 
between them |3] and we would check there all the scaling phenomena 
reported up till now. 




0.01 0.1 1 10 100 1000 10000 
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Figure 3: Results of the DE analysis for US dollar - Deutsche mark futures 
with T = (solid circles), the TARCH model (empty circles), the SV model 
(diamonds), and finally for the on-off intermittency model as given by Eq. 
(jU) (crosses). 

We will follow this procedure and choose two well accepted models for 
reconstructing market activity without deterministic peaks: (i) the TARCH 
model [in] , and (ii) the stochastic volatility model presented in (22] • We will 
see that both models give the same results than those of the DE technique. 
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We finally discuss the prescription given in Eq. (£Q) based on multi-agent mod- 
els to see whether it agrees with our results or not. 

3.1 The TARCH model 

This particular ARCH model, called TARCH by its authors [13^ is given by 
a 2 t = k + aR 2 t _ 1 + pa^ + Qi-Rt-ihRti (18) 

Rt = CTtVt 

where a t is the volatility, R t is the one day return calculated at time t, Q(x) is 
the Heavyside step function and r\t is Gaussian noise with zero mean and unit 
variance. The other parameters, estimated from daily data of the Dow Jones 
Industrial Index from 1988 to 2000 and obtained in pQ, are: k = 0.0184, 
a = 0.0151, 7 = 0.0654, (3 = 0.9282 [21]. Using Eq. (JIHJ) we generate a 
time series for at. We then perform the DE analysis on this series (with a 
time step of 1 day) by supposing that the number of trades in the i-th day is 
proportional to <jj. The results are shown in Fig. El (empty circles) compared 
with the results on real data for T = (solid circles). We clearly see that 
the TARCH model predicts for t < tp a scaling exponent 5 = 0.90 which 
agrees with actual data. For t greater than a Poissonian time tp « 100 days 
the model yields 5 = 0.5. It is worth noticing that the change in the slope 
of real data is likely due to the lack of statistics. Moreover, we do not have 
enough data points to determine whether the change of slope takes place at 
the same time scale than in the TARCH model. Nevertheless, ARCH-type 
models (similar results were obtained with 7 = in Eq. (jl8j) ) seem to take 
into account the correct structure of the intermittency of financial series. 

From the series generated using Eq. (JTHJ) we can also evaluate the waiting 
time distribution between peaks because now we do not have deterministic 
peaks and other periodic effects that were present in actual data. The result 
is shown in Fig. |U and as we see there that for r c < tp a good fit is provided 
by the following power law: 

where v = 0.1 day _1 and \x = 2.1. For r c > tp a clear exponential (Poisson) 
behavior is present. This result has a simple physical explanation: if there 
is a first cluster at time t the probability to observe another one just after 
the first is high while very distant clusters are practically independent which 
explains the asymptotic Poisson observed behavior in Fig0J 



10 



0.1 




0.0001 







50 



100 



150 



200 



250 300 350 400 



Time (days) 



Figure 4: Waiting time distribution for the distance between clusters r c (in 
logarithmic scale) for the TARCH model (solid circles) and the SV model 
(empty circles). A distinct asymptotic behavior is clearly present: a power 
law tail with exponent /i ~ 2.1 and an exponential decay. 

3.2 An stochastic volatility model 

There exists another way of modelling volatility clustering. The so-called 
stochastic volatility models El HI] are an alternative choice to ARCH 
models and they are considered to be the most natural extension to the 
classic geometric Brownian motion for the price dynamics in continuous- 
time finance. Let us start with the zero-mean return X(t) (i.e., the log- 
price without drift) and whose dynamics is given by the following stochastic 
differential equation 



where this equation has to be understood in the Ito sense and a is the volatil- 
ity and £idt is the Wiener process, i.e., £i(t) is Gaussian white noise with 
zero mean and correlation function given by 



All SV models assume that the volatility a is itself a random process. There 
are several ways to describe the dynamics of the volatility 2\. One of the sim- 



X(t) = aUt) 



(19) 



UtMt')) = 8(t-t'). 



(20) 
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plest models, which still contains almost all the basic ingredients prescribed 
by real markets, is given by the Ornstein-Uhlenbeck (OU) process [H] 

a(t) = -a(a-m) + k&i(t). (21) 

One key property of this model is that it exhibits a stationary solution thanks 
to the existing reverting force — quantified by a — to a certain average m, the 
so-called "normal level of volatility" . The stationary solution is a Gaussian 
distribution and the resulting distribution for the return has fat tails ["""""] . 
In addition, stylized facts such as the negative skewness and the leverage 
correlation "T*f*. [3] require that the changes of the volatility be negatively 
correlated with the random source of return changes. In other words, the 
driving noises appearing in Eqs. ("IT"]) and (|2*T|) are anti correlated, that is: 

where — 1 < p < 0. For the OU SV as given in Eq. ()21j) the characteris- 
tic exponential time decay of leverage correlation is given by 1/a which is 
typically of the order of few trading days (see below) . 

Although the OU model has some disagreements with observations [2*2*] . 
it is complex enough to catch all the statistical properties that we are study- 
ing here. We therefore simulate the SV model with the parameters esti- 
mated from daily data of the Dow Jones Index from 1900 to 1999. Thus, 
the reverting force is equal to a = 0.05 days _1 , the noise amplitude is 
k = 0.0014 days _1 , the normal level of volatility reads m = 0.011 days ~ 1/ ' 2 
and the correlation coefficient is p = —0.5 [13]. The results of the DE anal- 
ysis are reported in Fig. [""] (diamonds) while the waiting time distribution 
between clusters is shown in Fig. """] (empty circles). In this case, and analo- 
gously to the TARCH model, we also observe a power law behavior followed 
by an exponential decay. The only difference is the value of the Poissonian 
time tp which for this model is near to 40 days while for the TARCH model 
is approximately 100 days. We have checked numerically that this difference 
is due to the fact that the parameters defining each model are estimated from 
the Dow Jones index over a different period of time, much larger for the SV 
model than for TARCH model. In any case, we cannot discard any of these 
approaches on the basis of the empirical results. 

3.3 On-off intermittency models 

The intermittent model of the activity is also predicted and studied by sev- 
eral multi-agent or minority game models jfij El "23- These models can be 
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connected to on-off intermittency and they generally imply that the persis- 
tency of activity is subordinated to a random walk which indicates that the 
waiting time distribution has the form given in Eq.(J!J). As suggested in [HIE] 
we also obtain the so-called variogram of the data, although the results de- 
note that, if a 3/2 tail is present like in Eq.(jlJ), causing a \/t behavior in the 
variogram, the tail lasts less than 5 days probably because the FX market is 
more liquid than the ones considered in jlj |3] . Furthermore the DE analysis 
performed on a series generated according to Eq. (J1J) also leads to a transient 
followed by the exponent 5 = 0.5 of the asymptotic behavior whereas, as is 
clearly seen in Fig. El the transient never exhibits the exponent 5 = 0.90 
beyond 1 day but presents a constantly increasing exponent which is most of 
the time greater than 1. 

4 Conclusions 

We have performed the DE analysis on the activity of the tick by tick time 
series US dollar - Deutsche mark futures from 1993 to 1997. The results 
clearly show the presence of an anomalous scaling, for the probability dis- 
tribution of the activity p(y,t), near the exponent 6 = 0.90. We have also 
implemented the same analysis on the volatility obtained with the TARCH 
model and with the OU stochastic volatility model. We find in both cases an 
excellent agreement between the scaling measured either on the actual data 
and on the constructed series. 

We compare the results with the scheme of the subordination of the 
volatility to a random walk leading to Eq. (JH) observing that a power law 
exponent fi « 2 for the tail of the distribution of the distances between peaks 
4>(t c ) of volatility is more plausible. We believe that the main reason why 
the TARCH and the SV models give better results is that in on-off intermit- 
tency models the occurrence of a peak can be considered as subordinated to 
a random walk but the weak restoring force (which has to be included in the 
model in order to describe mean reversion) not only causes the final station- 
arity and the Poisson tail of Fig. 0] but also affects the process of regression 
to equilibrium modifying in a fundamental way (from /x = 3/2to/x~2) the 
transient behavior of 0(r c ). 
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